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sert-merge"  strategy.  That  is,  the  N  inputs  are  divided  into  2  groups  --  normally  of 
size  m  and  liNj  *  -  that  are  sorted  independently  and  then  "merged"  together  to 
form  a  single  ordered  sequence.  An  N-sorter  network  that  uses  this  strategy  consists 
of  2  smaller  sorting  networks  followed  by  a  merge  network.  The  best  merge  networks 
known  are  also  constructed  recursively,  using  2  smaller  merge  networks  followed  by  a 
simple  arrangement  of  T^nI  -  1  comparators.  Wo  consider  a  generalization  of 

+he  divide- .  ort-merge  strategy  in  which  the  N  inputs  are  divided  into  gj>2  groups  — 
of  size  N.,  N_, ...,N  where  N^+Np+.-.+N  =  R  —  that  are  sorted  independently  and 
then  merged  together?  The  merge  network  ?hat  combines  these  g  sort'd  groups  contain 
d>2  smaller  merge  networks,  where  d  is  a  common  divisor  of  N^,N_, . . . ,N  .  The  two 
parameters  g  and  d  together  define  what  we  call  a  "[g,  d]"  strategy,  ®  A  [g,d] 
N-sorter  network  contains  g  smaller  sorting  networks  followed  by  a  [g,d]  merge  net¬ 
work.  The  initial  portion  of  the  [g,d]  merge  network  consists  of  d  smaller  merge 
networks;  the  final  portion,  which  we  call  the  "f -network, "  includes  whatever  addi¬ 
tional  comparators  are  required  to  complete  the  merge.  When  g  =  d  =  2  the  f -network 
is  a  simple  arrangement  of  f^N]  -1  comparators;  however,  for  larger  g,d  the  structure 
of  the  [g,d]  f-network  becomes  increasingly  complicated.  In  this  paper  we  describe 
how  to  construct  [g,d]  f-networks  for  arbitrary  g,d.  For  N>8  the  resulting  [g,d] 
N-sorter  networks  are  more  economical  than  any  previous  networks  that  use  the  divide- 
sort -merge  strategy;  for  N  >  3^  the  resulting  networks  are  more  economical  than  previous 
networks  that  use  the  divide -sort-merge  strategy;  for  N  >3^  the  resulting  networks 
are  more  economical  than  previous  networks  of  any  construction.  Hie  [g,d]  N-sorter 
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With  a  few  notable  exceptions  the  best  sorting  networks  known  ha-'o 

employed  a  "divide- sort-merge’'  strategy.  That  is,  the  N  iriputs  are 

1  ' 

% 

divided  into  2  groups  -  -  normally  of  size  r  &Nl  and  J_^NJ  -  - 
that  are  sorted  independently  and  then  "merged"  together  to  form  a  single 
sorted  sequence.  An  N-sorRer  network  that  use.'?  this  strategy  consists 
of  2  smaller  sorting  networks  followed  by  a  merge  network.  The  best 
merge  networks  known  are  also  constructed  recursively,  using  2  smaller 
merge  networks  followed  by  a  simple  arrangement  of  (fj|N"l  -  1  comparators. 

W  consider  a  generalization  of  the  divide-sort-merge  strategy  in 
which  the  N  inputs  ;;re  divided  into  g  />  2  disjoint  groups  that  are 

sorted  independently  and  then  merged  together./  The  merge  network  that 

- — 

combines  these  g  sorted  groups  uses  d  >  2  smaller  merge  networks  as 
an  initial  subnetwork.  The  two  parameters  g  and  d  together  define 
what  we  call  a  "  [g,d]  "  strategy. 


*  Here  f  xl  denotes  the  smallest  integer  greater  than  or  equal  to  x, 
whereas  denotes  the  largest  integer  less  than  or  equal  to  x. 
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A  [g,d]  N-sorter  network  consists  of  g  smaller  sorting  networks 
followed  by  a  [g,d]  merge  network.  The  initial  portion  of  the  [g,d] 
merge  network  corjists  of  d  smaller  merge  networks;  the  final  portion, 
which  we  call  the  "f-network,"  includes  whatever  additional  comparators 
are  required  to  complete  the  merge.  When  g  =  d  =  2,  the  f-network  is 
a  simple  arrangement  of  f^N]  -  1  comparators;  however,  for  larger 
g,d  the  structure  of  the  [g,d]  f-network  becomes  increasingly  complicated. 

In  this  paper  we  describe  how  to  construct  [g,d]  f-networks  for 
arbitrary  g,d.  For  N  >  8  the  resulting  [g,d]  N-sorter  networks  are 
more  economical  than  any  previous  networks  that  use  the  divide-sort- 
merge  strategy;  for  N  >  34  the  resulting  networks  are  more  economical 
than  previous  networks  of  any  construction.  Tae  [4,4]  N-sorter  network 
described  in  this  paper  requires  jj-  N^oggN)  -  ^  N(log2N)  +  O(n) 
comparators,  which  represents  an  asymptotic  improvement  of  —  N(log2N) 
comparators  over  the  best  previous  N-sorter.  We  indicate  that  special 
constructions  (not  described  in  this  paper)  have  beei  found  for  [2r,2r] 
f-networks,  which  lead  to  an  N-soiter  network  that  requires  only 
.25  N(log2N)2  -  .372  N( loggN)  +  0(N)  comparators. 
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I.  Introduction 


A  comparator  network  with  4  inputs  is  illustrated  in  Fig.  1(a). 

Each  of  the  5  comparators,  labeled  A,  B,  C,  D,  and  E,  compares  its 
two  inputs  and  emits  the  smaller  on  its  higher  output  lead  and  the  larger 
on  its  lower  output  lead.  An  abbreviated  diagram  for  this  comparator  net¬ 
work  is  given  in  Fig.  1(b),  where  each  comparator  is  replaced  by  a  vertical 
line  connecting  the  two  comparands. 

A  comparator  network  with  N  input  and  output  leads  is  called  an 
X- sorter  network,  or  simply  an  M-sorter,  if  for  any  multiset  of  inputs 
I  =  {i^,  ±2, . . .  ,iN],  the  resulting  outputs  0  =  {opO?, . . .  ,oN}  satisfy: 

1)  0  is  a  permutation  of  I;  and  2)  o^  <  ok  if  J  <  k.  The  net¬ 

work  depicted  in  Fig.  1  is  a  4-sorter,  since  comparators  A  through  D 
move  the  smallest  input  to  o^  and  the  largest  input  to  o^,  and  then 
E  orders  the  remaining  two  inputs. 

From  an  engineering  viewpoint  it  may  be  desirable  to  use  as  few 
comparators  as  possible  when  constructing  a  network  to  sort  N  inputs. 

(An  alternative  design  objective  would  be  to  minimize  the  delay  required 
to  sort  N  items.)  Let  S(N)  represent  the  minimum  number  of  compara¬ 
tors  required  by  an  N-sorter  network.  R.  W.  Floyd  and  D.  E.  Knuth  [  2  ] 
have  determined  S(N)  for  N  <  8  by  proving  a  lower  bound  for  S(N) 
that  is  precisely  equal  to  che  number  of  comparators  actually  contained 
in  the  most  economical  N-aorter  notwork  known.  However,  for  N  >  8,  the 
value  of  S(N)  and  even  the  asymptotic  behavior  of  the  function  remain 
an  open  question.  The  strongest  lower  bound  known  for  S'H),  proved  by 

A  multiset  is  like  a  set  except  that  it  may  contain  repatltlons  of 
elements.  See  D.  I.  Knuth  [  1  ]. 


Fig.  1.  4-sorter  network. 
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D.  Van  Voorhis  [  3  ]>  increases  as  N^oggN),  whereas  the  strongest  upper 
bound  known  -  -  i.e.  the  number  of  comparators  actually  required  by  the 
most  economical  N-sorter  known,  designed  by  K.  E.  Batcher  [  4  ]  and  im- 

p 

proved  by  M.  W.  Green  [  5  ]  -  -  increases  as  N(log2N)‘. 

Batcher's  N-sorter  network  contains  b(n)  comparators,  where 

B(N)  =  *N(log2N)2  -  iN(log2N)  +  N  +  0(l).  (l) 

Although  Green  has  been  able  to  improve  upon  Batcher's  networks,  the  net 
effect  of  Green's  modification  is  simply  to  reduce  the  coefficient  in  the 
linear  term  of  Equation  (l)  from  unity  to  In  this  paper  we  present 

an  extension  of  Batcher's  constructions  which  reduces  the  coefficient  of 
N(log2N)  in  (l)  from  -  ^  to  -  i.  Our  construction  achieves  an  im¬ 
provement  of  ~  ~  N(log2N)  over  the  best  previous  networks,  although 

p 

the  asymptotic  growth  is  still  ^N(log2N)  .  We  indicate  that  a  modifi¬ 
cation  of  our  construction,  which  is  too  complicated  to  include  here, 
reduces  the  coefficient  of  N(log2N)  in  (l)  to  -.372. 


( 
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II.  The  Divide-sort-merge  Strategy 

It  is  not  always  easy  to  determine  whether  a  given  comparator  net¬ 
work  is  an  N-sorter.  For  example,  it  can  be  shown  that  the  comparator 
network  in  Fig.  2(b)  is  &  ^-sorter  whereas  that  in  Fig.  2(a)  is  not.  One 
way  to  check  a  network  is  to  see  whether  it  will  sort  all  Ml  permutations 
of  the  numbers  1,2, .  ..,N  as  inputs.  However,  the  following  important 

V 

theorem  reduces  to  2  the  number  of  input  patterns  required  to  test 
the  design  of  an  N-sorter  network. 

Theorem:  (Zero-One  Principle) 

A  comparator  network  with 
if  and  only  if  it  will  sort  all 
each  input  is  either  0  or  1. 

* 

Proof: 

The  "only  if"  portion  of  the  theorem  is  obvious;  to  prove  the  re¬ 
mainder  of  the  theorem  we  show  that  if  a  comparator  network  C  is  not 
an  N-sorter  network,  then  there  is  at  least  one  combination  of  0's  and 
l's  as  inputs  that  C  fails  to  sort. 

Suppose  that  C  is  not  an  N-sorter  network,  so  that  for  some  mul¬ 
tiset  of  inputs  T  =  (i1,i£,...,iN)  it  yields  the  incompletely  ordered 
outputs  0(1)  =  This  means  that,  although  0(l)  is  a 

permutation  of  I,  Oj  >  oJf  for  some  indices  satisfying  1  <  J  <  k  <  N. 

Now  it  is  easily  verified  (by  induction)  that  if  i(x)  is  any  non¬ 
decreasing  function  (i.e.  if  x  <  y  implies  that  t(x)  <  t(y),  )  then 


N  inputs  and  N  outputs  is  an  N-sorter 
2N  combinations  of  N  inputs  for  which 
(See  [2,5].) 


*  This  proof  was  suggested  to  the  author  by  D.  E.  Knuth. 


Fig.  4.  (nH-n) -sorter  network  T. 
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The  theorem  is  illustrated  in  Fig.  3*  The  inputs  I  =  {1,0, 1,0} 

are  applied  to  the  4  input  leads  of  each  network  in  Fig.  2.  The  first 

network  fails  to  arrange  the  inputs  into  non-decreasing  order;  therefore, 

it  is  not  a  4-sorter  network.  The  second  network  is  a  4-sorter  since  it 

will  order  properly  these  inputs  and  also  the  other  15  combinations  of 

0's  and  I's  as  inputs. 

N 

Although  2  grows  much  more  slowly  than  NI ,  it  is  not  feasible  to 

N 

test  large  networks  for  2  different  combinations  of  inputs.  Therefore, 
if  we  desire  large  sorting  networks,  we  must  build  them  in  such  a  way 
that  we  can  prove  "by  construction"  that  they  will  arrange  all  combina¬ 
tions  of  inputs  into  non- decreasing  order.  The  Zero-One  Principle  is 
helpful  in  develop! rig  such  proofs. 
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The  most  successful  strategy  for  designing  large  sorting  networks, 
suggested  by  R.  C.  Bose  and  R.  J.  Nelson  [  6  ],  has  been  to  build  then 
out  of  soaller  sorting  networks.  The  inputs  are  divided  into  two  groups 
that  are  sorted  separately  and  then  combined,  or  merged,  to  form  a  sin¬ 
gle  sorted  multiset.  This  divide- sort-merge  strategy  is  illustrated  in 
Fig.  4  by  the  N-sorter  network  T,  which  consists  of: 

i)  an  m-sorter  network  that  operates  on  the  inputs 

{i^ig,  . .  ,,im3  to  produce  the  sorted  multiset  X  = 

Cx1,X2,...,,m}; 

ii)  an  n-sorter  network,  where  n  =  N  -  m,  that  trans¬ 
forms  the  inputs  ^i^^i,^, . .  .  ,iN)  into  the  sorted 
multiset  y  =  followed  by 

iii)  an  (m,n)  merge  network  that  combines  X  and  Y  into 
the  single  sorted  multiset  0  =  {0^,0^ . . .,0^}. 

We  can  use  the  divide-sort  merge  strategy  recursively  to  achieve 
N-sorter  networks  for  arbitrary  N,  provided  we  can  construct  the 
necessary  merge  networks.  Bose  and  Nelson  suggested  building  an  (m,n) 
merge  network  out  of  three  smaller  merge  networks  arranged  in  a  pattern 
resembling  the  final  ^nree  comparators  of  the  4-sorter  in  Fig.  1.  For 
example,  when  m  and  n  are  both  even  and  m  <  n,  Bose  and  Nelson's 
(m,n)  merge  network  consists  of  the  following.  (See  Fig.  5*) 

BN1;  a  (^m,£n)  merge  network  that  determines  the  smallest 
members  of  0,  namely  o^Og,  -  •  ,,0^m;  and 
BN2:  a  (£n,^n)  merge  network  that  determines  the  largest 
members  of  0;  followed  by 


BN3:  a  ($n,^a)  merge  network  that  determines  the  remaining 


n  members  of  0. 


K.  E.  Batcher  [  4  ]  proposed  a  different  merging  strategy  which  is 


more  economical  than  Bose  and  Nelson’s,  and  which  has  not  been  improved 


upon.  The  general  (m,n)  merge  network  is  defined  recursively,  beginning 


with  the  (1,1)  merge  network,  which  is  a  single  comparator.  When  m 


and  n  are  both  even  integers  greater  than  one.  Batcher's  (m,n)  merge 


network  consists  of  the  following.  (See  Fig.  6.) 


Bl:  a  (^n,^n)  merge  network  that  combines  the  odd  members 


=  {-Vx3,*",Vl}  and  Yo  =  tyl'y3'“*'yn-l}  to 


form  the  odd  members  of  an  intermediate  multiset  V, 


namely  Vq  =  {v^, . . . ,  v^^}  ;  and 
B2:  a  (i^m,  £n)  merge  network  that  merges  the  even  members 

Xe  and  Ye  t0  f0rm  Ve  =  {VV,,,,VmJ:  followed  by 
B3:  the  £(m+n)-l  comparators  V2k+2:V2k+3,  0  5  k  <  £(m(-n)-l. 

Since  Batcher's  (m,n)  merge  network  is  the  simplest  example  of  a  more 


general  strategy  described  in  the  next  two  sections,  it  is  instructive  to 


work  through  the  proof  that  the  network  described  above  and  depicted  in 


Fig.  6  leaves  the  outputs  0  =  fo,,o.„ltJo  }  sorted. 

12  '  m+nJ 

Suppose  that  the  network  T  depicted  in  Fig.  k  consists  of  any 
m-sorter  network,  any  n-sorter  network,  and  Batcher's  (m,n)  merge  net¬ 


work.  Clearly  the  (m,n)  merge  network  orders  0  iff  t  is  an  (m+n)- 


sorter  network.  Therefore,  the  Zero-One  Principle  guarantees  that  the 


(m,n)  merge  network  orders  0  iff  T  sorts  a’l  combinations  of  m+n 


0's  and  l's  as  inputs. 


For  any  combination  of  0’s  and  l's  as  inputs  to  T,  the  111- 

sorter  sorts  X  while  the  n-sorter  sorts  Y.  The  sorted  multiset  X 

consists  of  r  0's  followed  by  m-r  l's,  and  Y  contains  s  0’s 

followed  by  n-s  l's,  where  for  different  combinations  of  inputs  to 

T,  r  and  s  assume  all  combinations  of  the  values  0  <  r  <  re; 

0  <  s  <  n.  Let  n  represent  the  number  of  0's  that  go  into  V  , 

—  —  o  o 

that  is,  the  number  of  0's  in  X  plus  the  number  in  Y  .  Let  n 

’  o  o  e 

represent  the  number  of  0's  that  go  into  V  .  Then 

e 

n  <  n  <  n  +2,  (4) 

e  —  o  —  e  *  v  ’ 

since  each  of  the  two  sorted  multisets  X  and  Y  contributes  either  the 

same  number  of  0's  to  V  and  V  or  else  one  more  0  to  V  . 

o  e  o 

After  the  odd  and  even  members  of  X  and  Y  have  been  merged  to 
form  Vq  and  Vfi,  the  following  situation  exists: 

1)  V  and  V  are  each  ordered. 

'  o  e 

2)  The  first  2n  elements  of  V  are,  therefore,  all  0. 

© 

3)  The  remaining  m  +  n  -  2ng  elements  are: 

a)  all  1  if  r.  =  n  ;  or 

o  e 

b)  0  followed  by  l's  if  n  =  n  +  1;  or 

c)  010  followed  by  l's  if  n  a  n  +  2. 

'  J  c  e 

The  elements  of  V  are  sorted  except  in  Case  c)  which  requires  an 

additional  comparator  for  the  adjacent  pair  v^  +0  •  vOQ  +^.  For 

different  combinations  of  inputs  to  T,  n^  and  n^  will  assume  all  of 

the  values  0,1, . . . ,^(m+n).  Case  c)  can  occur  for  each  of  the  possible 

values  of  ng  such  that  nQ  =  nfl  +  2  <  £(m+n).  Therefore,  the  comparators 
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listed  in  B3  above  are  both  necessary  and  sufficient  to  complete  the 
merge. 

Batcher's  merge  strategy  is  illustrated  by  the  8-sorter  network  it; 

Fig.  7.  The  10  comparators  in  Part  A  comprise  two  4- sorters  that  order 

X  and  Y.  (Note  that  each  4-sorter  consists  of  two  2-sorters,  i.e. 

comparators,  followed  by  a  (2,2)  merge  network.)  The  three  comparators 

in  Part  B  merge  Xq  =  {x^x^  and  Yq  =  {y^y,^}  to  form  Vq  «  {v^v^ 

v  ,v  ),  while  the  three  comparators  in  Part  C  comprise  a  (2,2)  merge 

network  for  X  and  Y  .  The  comparators  in  Part  D  are  those  called 
e  e 

for  in  B3,  which  combine  V  and  V  to  form  O, 

7  o  e 


III.  The  [3,3]  Merge  Strategy 

An  obvious  extension  of  the  divide-sort-merge  strategy  as  described 
above  is  to  partition  the  N  inputs  into  g  >  2  groups  that  are  sorted 
separately  and  then  merged  together.  An  N-sorter  network  that  uses  this 
g-way  divide-sort-merge  strategy  consists  of  g  sorting  networks  of  size 
N^Ng, . .  .,Ng,  where  E®^  =  N,  followed  by  an  (N^Ng, . . .  ,N^)  merge 

network.  As  an  extension  of  Batcner's  merge  strategy,  we  can  design 
g-wny  merge  networks  that  begin  with  d  >  2  smaller  g-way  merge  networks, 
where  d  is  a  common  divisor  of  N  ,N  , . . . ,N  . 

12  g 

The  two  parameters  g  and  d  together  define  what  we  shall  call 
the  [g,d]  merge  strategy.  We  say,  then,  that  Batcher's  networks  described 
in  the  last  section  use  the  [2,2]  strategy. 

A  [g,d]  (N  ,Hp,...,N  )  merge  network  consists  of  d  (N  / d, . . . ,N  /d, 

1  2  g  i  K 

merge  networks  followed  by  whatever  additional  comparators  are  required 
to  complete  the  merge.  We  shall  call  the  network  comprising  these  final 
additional  comparators  the  fg,d]  f- network.  The  [2,2]  f-network,  namely 
the  comparators  listed  in  part  B3  of  Batcher's  merge  network,  is  par¬ 
ticularly  Bimple.  In  the  remainder  of  this  section  we  illustrate  a 
procedure  for  designing  [g,d]  f-network*  for  arbitrary  g,d,  by  con¬ 
sidering  the  case  g  =  d  =  3* 

Suppose  that  we  wish  to  design  an  im,n,p)  merge  network  that  will 

combine  the  three  sorted  multisets  X  »  {xj,Xg, . . .  ,-x^} ,  Y  ®  (y^Yg,  •  •  •  •>'„ 

and  Z  *  {z^Zg,.  }  into  the  single  sorted  multiset  0  « 

o  }.  If  m,  n,  and  p  are  all  multiples  of  3,  then  the  [  .3]  merge 

m+n+pJ 

network  consists  of  the  following,  \Svv  Fig.  8.; 
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M31:  an  (>V'3>*V'3>P/3)  merge  network  that  combines  X  = 

a 

( Ya  “  I’l'V  •- V2^’  and  za  “ 

tVV-Vs)  toIor* 

M32:  an  (ra/3,n/3,p/3)  merge  network  that  combines  X  - 

-  b 

(x2»x5'-"'xin_ii>  Yb  =  1  *md  Zb  = 

tvv— **p-i}  tofor,c  vb  =  !y5 . Wil; 

M33 •  an  merge  network  that  combines  X  = 

c 

{x3>x6>*“>xm)>  =  anc^  zc  =  ^Z2’ 

z6>'“> zp}  to  form  Vc  =  {v^  vg, . . . , vffl+n+p}  ;  followed  by 
M3^:  the  [3*3]  f-network  that  we  have  yet  to  define. 

Now  the  Zero-One  Principle  guarantees  that,  without  loss  of  gener¬ 
ality,  we  may  assume  that  all  members  of  X,  Y,  and  Z  are  either  0 
or  1.  (To  see  that  this  is  so,  consider  an  (m+n+p)~ sorter  network  that 
consists  of:  an  m-sorter  that  produces  the  sorted  multiset  X;  an 
n- sorter  that  produces  Y;  and  a  p-sorter  that  produces  Z;  followed 
by  an  (m,n,p)  merge  network.)  When  all  members  of  X,  Y,  and  Z  are 

either  0  or  1,  we  find  that  the  number  of  0's  in  V  ,  V.  ,  and 
’  a’  b’ 

V  satisfies 

c 

nc  <  “t,  <  “a  <  nc  +  3‘  (5) 


Therefore,  after  the  three  3-way  merges  described  by  M31  through  M33, 


the  following  situation  exists 


and  V  are  each  ordered 


2)  The  first  3n  elements  of  V  are  all  0 


1 6 


3)  If  na  =  nb  “  ncf  then  the  remainin3  elements  of  V  are 
all  1;  otherwise,  the  remaining  elements  exhibit  one  of 
the  following  patterns  followed  by  l's. 


a) 

0 

if 

n 

= 

D 

+ 

1  =  n  + 

1; 

a 

b 

c 

b) 

00 

if 

n 

_ 

n. 

n  +  1; 

a 

b 

c  ’ 

«) 

0110 

if 

n 

_ 

n. 

+ 

2  =  n  + 

2; 

a 

b 

c 

<0 

0010 

if 

n 

a 

= 

“b 

+ 

1  =  n  + 
c 

2; 

e) 

00100 

if 

n 

_ 

n. 

_ 

n  +  2; 

a 

b 

f) 

0110110 

if 

n 

a 

= 

“b 

+ 

3  =  nc  + 

3; 

g) 

0010110 

if 

n 

a 

= 

“b 

+ 

2  -  n  + 
c 

3; 

h) 

0010010 

if 

n 

a 

= 

nb 

+ 

1  =  n  + 
c 

3; 

i) 

00100100 

if 

n 

a 

- 

nb 

= 

nc  +  3. 

It  is  readily  verified  that  patterns  c)  through  i)  are  all  sorted 
by  the  following  sequence  of  comparators. 


v3V3:v3nc+7' 

V3nc+2:V3nc4V 

V  *  V 

3“c+3‘  3V5’ 

V3nc+3:V3nc+4» 


0  <  nc  <  f  -  2; 

0  <  n  <  t  -  1; 

—  c  - 

0  <  n  <  t  —  1 ; 

—  c  - 

0  <  n  <  t  -  1, 

—  c  — 


(6) 


where  t  =  (aH-n+p)/3.  These  comparators  constitute  the  [3>3]  f-network. 

The  [3,3]  strategy  is  illustrated  by  the  12-sorter  in  Fig.  9.  The 
Inputs  are  lultially  partitioned  into  the  three  multisets  {i^igji^i^, 
i yig],  {yi8,i9},  and  {i l0,iu,i12}  that  ar®  Borted  «®P«ately. 


The  networks  required  to  sort  these  three  multisets  are  each  abbreviated 
by  double  vertical  lines  connecting  the  appropriate  comparand^.  The 
(6, 3,3)  merge  network  begins  with  three  (2,1,1)  merge  networks  that  form 
V  ,  Vfe,  and  Vc-  These  merge  networks,  which  are  abbreviated  by  a 
single  vertical  line,  are  simply  4-sorters  without  the  initial  compara¬ 
tor  connecting  the  pair  from  X.  The  remaining  11  comparators  constitute 
the  [3,3]  f-network  defined  by  (6). 


1? 


IV.  [g,d]  Sorting  Networks 

For  every  pair  of  integers  g,d  >  2  we  can  construct  N-sorte.r 
networks  using  g  small  sorting  networks  followed  by  a  [g,d]  merge 
network  which,  by  definition,  begins  with  i  small  merge  networks.  A 
sorting  network  that  begins  with  g  sorting  networks  and  d  merge  net¬ 
works  will  be  called  a  [g,d]  sorting  network,  even  if  the  g  sorting 
networks  and  d  merge  networks  do  not  employ  the  [g,d]  strategy  in¬ 
ternally.  For  example,  the  12-sorter  in  Fig.  9  is  called  a  [3>3] 
sorting  network  regardless  of  the  construction  of  the  initial  6-sorter 
and  the  small  merge  networks. 

In  order  to  facilitate  the  general  discussion  of  [g,d]  sorting  net¬ 
works,  we  adopt  the  following  conventions. 

1)  The  purpose  of  an  N-sorter  network  is  to  accept  as  input  the 

unordered  multiset  I  =  {i^,ig,  *  *  ’> *n3  ant*  to  Pro<*uce  as 
output  the  sorted  multiset  0  =  {o^o^,  * '  where  0  is 

a  permutation  of  I  and  o^  <  o^  <  ...  <  o^.  The  Zero-One 
Principle  allows  us  to  assume,  without  loss  of  generality, 
that  all  members  of  I  are  either  0  or  1.  We  make  this 
assumption  throughout  the  remainder  of  this  paper . 

2)  The  g  initial  sorting  networks,  labeled  si,s2, • • * >8gt  each 
operate  on  an  integral  multiple  of  d  members  of  1.  The  out¬ 
puts  of  these  g  sorting  networks  together  form  a  partially 
ordered  multiset  X  =  {x^x^ . .  .,xN),  where  x1  is  the  smallest 
output  from  s^  xg  is  the  second  smallest  output  from  Sj, 

. ..,  and  x„  is  the  largest  output  from  s  . 

N  ® 


i ,  *-  ^  -e-' '  ■■  ,f  *'(■ 


j  k  ‘V  -  * 
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\  th 

3J  The  j  merge  network,  1  <  J  <  d>  operates  on 
1  <  i  <  N/d  to  produce  1  <  i  <  tyd. 

4)  The  [g,d]  f- network  operates  on  V  to  produce  0. 

The  transformation  from  the  unordered  multiset  I  to  the  completely 
ordered  multiset  o  may  be  summarized  by 


g  sorting^  d  merge  >  f-network  ^ 

networks  networks 


The  [g,dj  f-network  is  defined  informally  to  be  any  network  that  will 
complete  the  ordering  of  the  intermediate  multiset  V  achieved  in  the 
[g,d]  N-sorter  network,  N  =  td.  Before  giving  a  formal  definition,  let 
us  examine  the  partial  ordering  in  V.  It  is  convenient  to  consider  V 
to  be  a  t  X  d  array,  where 


V(i,j)  -  V(i-l)d+j* 

The  t  rows  and  d  columns  of  V  are  given  by 


(7) 


(i»*) 


(*fJ) 


1  <  j  <  d 


1  <  i  <  t 


\WM)\ 


l  <  i  <  t; 

(3) 

1  <  j  <  d. 

(9) 

Note 


that  the  column  ^ y  1  <  J  <  d,  is  completely  ordered  since 


.th 


its  t  members  are  the  t  outputs  of  m^,  the  J  merge  network. 

If  the  kth  initial  sorting  network  sk  accepts  ^  0's  as 
inputs,  then  the  uniform  distribution  of  the  elements  of  X  among  the 


of  these  n,  0’s  are 
k 


d  merge  networks  guarantees  that  [  (n^+d-jj/dJ 
passed  to  merge  network  m  .  Therefore,  the  total  number  of  0's 

J 

goes  into  m^,  and  into  jj,  is  given  by 

n,  ,  =  Z  |(n+d-j)/dj,  l<j<d. 

1  i  <  k  <  g  k 


that 


(to; 


We  may  use  Equation  (10)  to  show  that 


n 


(*,a)  ; 


-  (*,1) 


(11, 


Equations  (4)  and  (5)  are  special  cases  of  (ll). 

We  have  seen  that  the  d  columns  ^  are  each  ordered.  The 

following  theorem  specifies  the  remaining  partial  ordering  in  V. 


Theorem  1: 

Consider  the  Boolean  multiset  V  =  (v.^, vg, . . . , vNJ,  where  N  =  td. 
Suppose  that  the  d  columns  given  by  (9),  are  each  ordered. 
Then 


a) 


the  t  rows  given  by  (8), 

if  and  only  if  the  number  of  0's  in 


are  also  each  ordered 
V/„  sacisfies 

(*>i) 


(*,d)  ^  n(*,d-l)  ^  **•  ^  "(*,l)’ 


<  n, 


(12) 


and 


b)  the  relation  n ^  <  n^#  d)  +  ®  implies  that 


Proof: 


a)  The  theorem  is  illustrated  in  Pig.  10.  Since  each  column  v, 

(*>j) 

is  ordered,  the  upper  n^^  elements  of  ^  ~  -  that  is, 

V(i,j)>  1  5  1  <  n(*,j)  "  “  are  a11  0  and  the  remaining 

t  "  elements  are  1.  if  we  draw  a  line  from  left  to 

right  in  V  representing  the  step  function  h(l)  a  t  -  n 
then  all  elements  of  v  above  h  are  0,  whereas  all  elements 
below  h  are  1.  Now  the  rows  are  all  ordered  iff 

no  1  appears  to  the  left  of  a  0  in  any  row.  Clearly  this  is 
the  case  iff  the  line  h(j)  separating  O's  from  l's  is 
non-decreasing,  that  is,  iff 

t-n(*,i)  ^  t'n(*,2)  ^  5  t_n(*,d)*  W 

(See  Fig.  10(b).)  Equation  (14)  is  equivalent  to  Equation  (12). 
b)  Since  v  is  Boolean 

V(5+g,l)  =  1  V(i,d)  ^  V(i-!-g,l)* 

Also,  since  ^  is  ordered. 

V(i,J)=0  <tf>  <*) 

(*,1)  -  >,<0 


Therefore,  if  n 


+  g,  then 


kb 


=  0  =>  n 


(,,i)>  l+e 


*  2 1 


*•  V(M)'° 


*  vd,o)  -  v(i+«,i) 


(17) 


Together  (15)  and  (17)  imply  that  v(t  4)  <  y(i+g 


Q.E.D. 


Since  the  columns  V(^  are  ordered,  and  since  n(#j)  satisfies 
(11),  Theorem  1  and  the  transitivity  of  the  relation  "less  than  or  equal 
to  imply  the  following  corollary. 


Corollary  1: 

Let  v  =  (vjjV2, •••,VN}>  N  =  td,  be  the  intermediate  multiset 

achieved  by  the  [g,d]  N-sorter  network  T.  Then  for  any  multiset,  of 

inputs  to  T,  V,.  <  V,  .  if 

(1*J)  -  (r,s) 

a)  r  >  i  and  s  >  j;  OR 

b)  r  >  i+g. 

The  partial  ordering  in  V  is  illustrated  in  Pig.  11,  for  the  case 
g  =  3,  d  »  4,  t-6,  with  an  arrow  from  V(m)  to  j  representing 

the  relation  <  V,r^j.  R.  w.  pioyd  has  pointed  out*  that  the 

partial  ordering  in  v  is  exactly  characterized  by  Corollary  1  and  Pig.  11. 

*  Private  c Jmmunlcation. 


mm 
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By  this  we  mean  that  it  V  =  {v^ v^, . . . , v^}  is  any  Boolean  multiset  that 

satisfies  the  partial  ordering  specified  for  V  by  Corollary  1,  then 

A 

there  is  at  least  one  combination  of  inputs  to  T  that  achieves  V  =  V. 

The  sublety  of  this  observation  is  best  illustrated  by  a  partial  ordering 
that  is  not  exactly  characterized.  Consider  the  comparator  network  that 
results  from  removing  comparators  D  and  E  from  the  4-sorter  in  Fig.  1. 

The  partial  ordering  in  the  multiset  0  does  not  include  either  o^  s  c„. 
or  ^  o^.  since  0([0,l,0,l})  =  {0,1,0,!}  and  0([l, 1,0,0})  =  {0, 1,1,0}. 
However,  no  combination  of  inputs  will  achieve  0  =  {0, 1,0,0} . 

We  have  defined  a  [g,d]  f-network  informally  as  a  network  that  will 
conplete  the  ordering  of  the  intermediate  multiset  V  achieved  in  the 
[g,d]  N-sorter  network,  N  =  td.  The  following  is  a  more  formal 
definition. 


Definition  1: 

A  sequence  of  comparators  is  called  a  [g,d]  f-network  for  N  =  td 
items  if  and  only  if  it  will  complete  the  ordering  of  the  multiset 
V  =  £v1, v2, ... ,VN),  when  a)  the  columns  V ^  ^  of  V,  given  by  (9),  are 
each  ordered  and  h)  n,  x  satisfies  (ll). 

We  can  construct  [g,d]  f-networks  for  arbitrary  g,d  by  i)  using 
(ll)  to  determine  what  un&orted  patterns  of  0's  and  l*s  remain  in  V; 
and  ii)  finding  a  sequence  of  comparators  that  will  order  these 
unsorted  patterns.  Following  this  procedure  we  have  derived  f-networks 
for  g,d  £  4;  the  best  f-networks  obtained  are  tabulated  in  Table  1. 


Strategy  f-network  for  N-sorter,  N  =  td 


A 

fCs,cT] 


(N) 


[2,2] 

V(i,2):V(i+l,l)'  1  <  1  5  t_1* 

-  1 

[2,3] 

V(i,2):V(i+l,l)'  1  5  1  <  t_1; 

V(±^3) iV(i+l,2)^  1  <  1  <  t-l; 

V(i,3):V(i+l,l)' 

N  -  3 

[2,4] 

v(i,3):V(i+i,i)'  1  <  1  <  t-i; 

V(i,4) :V(i+l,2)*  1  -  1  -  t_1; 

v(i,2):v(i,3)»  1  <*■<*> 

V(i,4):V(i+l,l)>  1  -  1  -  t_1* 

N  -  3 

[3,2] 

V(i,2):V(i+l,l)'  1  -  1  -  t_1; 
V(i,l):V(i,2)'  2  -  1  -  t“2* 

N  -  3 

[3,3] 

V(*,3):V(i+2,l)'  1  -  1  -  t-2; 

V(i,2):V(i+l,l)>  1  <  i  <  t-1; 

V(i,3):V(i+l,2)^  l<i<t-l; 

V(i,3):V(i+l,l)'  l<i<t-l. 

¥-> 

Table  1.  Small  f-networks 


Strategy 


f-uetwork  for  N-sorter 


V(lA):V(3,l ); 

v(t-2,4):V(t,i): 

v(i,3):V(V0; 

V(3,l):V(3,2); 

v(t-2,3):V(t-2,4); 

[3A] 

V(t,l):V(t,2); 

V(i,3):V(i+2,l)^ 

2  <  i  <  t-3; 

2N-12,  K=12 

2N-11,  N>12 

V(i,4):V(i+2,2)' 

2  <  1  <  t-3; 

V(i,2):V(i+l,l)> 

1  <  i  <  t-2; 

v(i,4):V(l+l,3)> 

2  <  i  <  t-1; 

V(i>3)  :V(i+3-x 

i  <  !  <  t-1; 

V(i,4):V(i+l,2)' 

1  <  i  <  t-1; 

V(i^ 2) * V( 3 ) ^ 

2  <  i  <  t-1; 

v(i,iO:V(i+i,i)' 

1  <  i  <  t-1. 

[4,2] 

V(i,2):V(l+2,l)' 

V(i,2)sV(l+l,l)' 

1  <  i  <  t-2; 

1  <  1  <  t-1. 

N  -  3 

Table  1.  (cont)  Small  f-networks 


’Hmwm 
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Except  for  the  [3,4]  and  [4,3]  f-networks,  each  of  the  tabulated 
f-networks  is  completely  described  by  a  sequence  of  templates  of  the  form 

V(i,a):V(i+y,3)  Where  1  <  <  d>  Y>0,  and  a  <  yd  +  3  — 

followed  by  a  range  for  i,  which  is  specified  in  terms  of  t  =  N/d.  The 

[ 3 ,4]  and  [4,3]  f-networks  are  described  by  several  specific  comparators, 
in  addition  to  templates.  Note  that  when  N  =  12,  half  of  these  specific 


comparators  are  redundant  and  may  be  eliminated.  For  example,  the  second 

comparator  listed  for  the  [3,4]  f-network,  namely  v (t_2)k):V be~ 

comes  V/,  .  s,  which  is  the  same  as  the  first  comparator  listed. 

\1,4J  (3,1/ 

Let  fr  n(N)  represent  the  minimum  number  of  comparators  required 
Lg,d] 

by  a  [g,d]  f-network  for  N  items.  (Note  that  this  function  is  only  de¬ 


fined  when  N  is  a  multiple  of  d. )  Since  we  have  not  proved  that  the 


tabulated  f-networks  are  minimal,  we  have  labeled  the  number  of  compar¬ 
ators  that  they  require  f^g  d-j(N).  For  each  of  the  tabulated  f-networks, 
except  the  [3,4]  and  [4,3]  f-networks,  we  find  that 


ar  .,N 
[g,d] 


W]' 


where  a|-g  ^  is  (l/d)  times  the  number  of  templates  and  brgjd]  is 
constant.  The  tabulated  [3,4]  and  [4,3]  f-networks  are  also  described  by 


(l8)  for  N  >  12. 

For  large  g,d  it  becomes  increasingly  difficult  to  derive  an  eco¬ 
nomical  [g,d]  f-network,  since  the  number  of  patterns  of  0's  and  l's 
allowed  by  (ll)  increases  rapidly.  Let  P(g,d)  represent  the  number  of 
patterns  of  0's  and  l's  consistent  with  (ll),  that  is,  the  number  of 
different  combinations  of  values  that  a(*f2 )*  *  n(*,d-l)  Can 
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assume  for  each  value  of 
we  observe  that 


n(*,dr 


Witn 


n(*,j) 


abbreviated  by 


> 


P(g,d)  =  Z  S  ...  Z  1.  (19) 

VWg  WrV2 

We  may  obtain  a  recurrence  relation  for  P(g,d)  by  noting  that 

P(g,d)  =  Z  Z  ...  Z  1 

VW*-1  W"i  Vnd-i£nd-2 

+  £  ...  Z  1 

“dW  ndSVlSV2 

=  P(g-l,d)  +  P(g,d-l).  (20) 

The  solution  to  (20),  with  the  boundary  conditions  P(l,d)  =  d,  P(g,l)  =  1, 
is  simply 

(2!) 

Note  that  (2l)  yields  P(2,2)  =  3  and  P(3,3)  =  10,  which  agrees  with 

our  analysis  of  the  [2,2]  and  [3,3]  merge  networks. 

When  N  >  gd,  the  problem  of  designing  an  f-network  that  will  order 

P(g,d)  patterns  of  0's  and  l's  represents  a  considerable  reduction  of 

N 

the  original  problem  of  designing  an  N-sorter  network  that  will  order  2 
different  input  patterns.  However,  for  large  g,d,  we  find  that  P(g,d) 


32 


becomes  too  large  to  permit  an  exhaustive  test  of  a  proposed  design  for 
a  [g,d]  f- network.  Therefore,  for  large  g,d  we  must  build  f- networks 
in  such  a  way  that  we  can  prove  "by  construction"  that  they  complete  the 
ordering  of  V.  Suitable  procedures  for  constructing  large  [g,d] 
^-networks  are  given  *n  the  next  section. 
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V.  Constructing  Large  [g,d]  f-networks 

Our  approach  to  the  problem  of  deriving  large  sorting  networks  and 
large  [g,d]  merge  networks  is  to  build  them  out  of  smaller  sorting  net¬ 
works  and  smaller  merge  networks.  We  use  the  same  approach  to  the 
problem  of  designing  large  [g,dj  f-networks.  We  will  present  two 
construction  metiods  in  tho  form  of  theorems.  Theorem  2  below  describes 
a  procedure  for  constructing  a  [g,sd]  f-network  using  d  small  [g,s] 
f-networks  and  one  [g,d]  f-network.  Theorem  3  describes  a  similar 
procedure  for  building  an  [sg,d]  f-network  out  of  s  small  [g,d]  and 
one  [s,d]  f-networks.  We  may  use  these  constructions  and  the  f-netwerks 
given  in  Table  1  to  achieve  f-networks  for  arbitrarily  large  g,d. 

Before  giving  the  theorems,  we  will  deserve  an  example.  Suppose 
we  desire  to  construct  a  [3,6]  f-network  for  the  [3,6]  l8-sorter  net¬ 
work.  The  partial  ordering  in  the  intermediate  multiset  V  is  depicted 
in  Fig.  12(a).  In  Fig.  12(b)  we  have  isolated  the  partial  ordering  in 
the  even  members  of  V.  Clearly  a  [3,3]  f-network  will  order  Vj 
similarly,  another  [3,3]  f-network  will  order  Vo- 

The  partial  ordering  depicted  in  12(a)  guarantees  (by  Theorem  l) 


that 

°(*,6)  i  °(*,5)  -  '  •  '  5  "(.,1)  5  n(*,6)  +  3- 

The  number  of  0's  in  V  and  V  are  given  by 


(22) 


no  =  n(*,i)  +  n(*,3)  +  V,5): 

De  =  V,2)  +  “(M)  +  V,6)' 
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so  that 


0  <  <n(*,5)  ‘  n(*,6)}  +  (n(*,3)  "  “(M)5  +  (n(*,i)  '  ■ 


=  no  “  ne  <  n(*,i)  "  n(*,6)  *  3- 


(24) 


Therefore,  a  [3,2]  f- network  will  complete  the  ordering  of  V,  once  Vg 

and  V  have  each  been  ordered, 
o 

Since  the  two  small  [3,3]  and  one  full-sized  [3,2]  f-network  will 
complete  the  ordering  of  V,  they  together  constitute  a  [3*6]  f-network. 
The  resulting  [3,6]  l8-sorter  network  is  given  in  Fig.  13. 

For  Theorems  2  and  3  below  it  is  convenient  to  consider  the  multiset 
V  =  cv1,v2, ...,vN)  ,  N  =  pqr,  to  be  a  p  X  q  X  r  array,  where 

(25) 

V(i, j,k)  =  v(i-l)qr+(j-l)r+k* 

Submultisets  of  V  include  the  pq  "rows,"  pr  "columns,"  and  qr 
"verticals"  defined,  respectively,  by 


KJ  | 

1  <  k  £  r  j 

-  ^  | 

[y  .) 

(i,*,k)  1  <  J  <  q  j 

l 

lT  A 

(*,j,k)  "  1  <  i  <  p  j 

k  <  r;  (26) 


Larger  submultisets  of  V  include  the  p  q  X  r  "planes",  the  q  p  X  r  planes, 
and  the  r  p  X  q  plar.es  defined  by 
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v(w)  =  i<T^,{V(,’J’k)}’  lsk-ri 

V(* >*)  =  ^  ^  |vfi  i  k)l#  1  5  ^  5  q;  (27) 

k  ’  i<i<p  1  <  k  <  r  (  1  'j 
V<W)  *  1  !>?<  r  {w>J'  1  -  1  " 


For  example,  if  we  consider  the  intermediate  multiset  V  for  the  [3,6] 
l8- sorter  (Fig.  12)  to  be  a  3x3x2  array,  then 


¥(1,2,1)  "  v3: 
v(*,2,i)  “  {v3»v^vi5); 


V,„  „  . ^  =  V  . 


We  are  now  ready  for  Theorems  2  and  3- 

Theorem  2: 

Let  the  multiset  V  =  {y1,v2, where  N  =  tsd,  be  considered 
a  t  x  s  x  d  array.  Then  the  following  small  f-networks  togethor 
constitute  a  [g,sd]  f-network  for  V. 

i)  d  [g,s]  f-networks  for  V^#  ^  ^y  1  <  k  <  d;  by 

ii)  one  [g,d]  f-network  for  V. 
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Proof : 

According  to  Definition  1,  the  sequence  of  comparators  represented 

by  i)  and  ii)  is  a  [g,sd]  f-network  for  V  if  and  only  if  it  will 

complete  the  ordering  of  V  given  that:  a'*  the  planes  V 

*  f;  3  £  s>  l<k<d,  are  ordered;  and  b)  n  satisfies 

*  j  i  k  ; 


(*,S,d)  <  n(*,S,d-l)  -  -•  <  V,s,  1)  <  n(*,3-l,d)  < 


“(*> s-l,d-l)  <  n(*,i,i)  5  n(*,s,d)  +  g> 


(0><) 


Let  us  assume  that  the  partial  ordering  n  V  satisfies  con¬ 
ditions  a)  and  b).  Then  since  the  submultisets  V/,  •  ,  \  of  v,  ,  ,  \ 

{*>*>*<' 

are  ordered  and  since  n,  .  satisfies  (29)  a  [g,s]  f-network  will 
order  Now  the  number  <f  C’s  in  ^  ^  , both  before 

and  after  the  application  of  the  [g,s]  f-network)  is  given  by 


n 


(*,*,k) 


E 

1  <  j  <  s 


(30) 


For  any  two  indices  k  ,k  satisfying  like  k  <  d.  we  mav  use 

1  d.  “  1  Cl  — 

(29)  to  show  that 


0  <  E  (n,  .  ,  \  -  n  .  .  , 

"  1  <  j  <  s  (*'J'kl 


n  .  ,  -  n 


-  ”(*,1,1)"  ”(*,»,d)-g  • 


i31; 


Therefore,  once  the  [g,sl  f-networks  have  ordered  the  planes  V  , 

;+,*,k 

(3-0  guarantees  that  a  [g,d]  f-netwovk  will  complete  the  ordering  of  V. 
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We  have  seen  that  if  the  partial  ordering  in  v  satisfies  con¬ 
ditions  a)  and  b),  then  the  d  [g,s]  f-networks  in  i)  followed 
by  the  [g,d]  f-network  in  ii)  will  complete  the  ordering  of  V. 
Therefore,  i)  and  ii)  together  constitute  a  [g,sd]  f-network. 

Q.E.D. 


Theorem  3s 

Let  V  be  as  in  Theorem  2.  Then  the  following  small  f-networks 
together  constitute  an  [sg,d]  f-network  for  V. 

i)  s  [g,d]  f-networks  for  V/„  .  1  <  j  <  s:  followed  by 

J  —  — 

ii)  one  [s,d]  f-network  for  V. 


Proof: 

The  proof  of  Theorem  3  is  similar  to  that  for  Theorem  2  and  is 
given  in  Appendix  A. 

The  partial  ordering  in  the  intermediate  multiset  V  for  the  [3-2,3] 
18-sorter  is  given  in  Fig.  14.  The  construction  method  described  by 
Theorem  3  requires  a  [3,3]  f-network  connecting  the  three  odd  rows 
of  V  (V/„  ,  .n)  and  a  [3,3]  f-network  for  the  even  rows  (V,  ), 

followed  by  a  [2,3]  f-network.  The  resulting  [b,3]  l&-sorter  network 
is  given  in  Fig.  15. 

We  many  count  the  comparators  required  by  the  f-networks  constructed 
according  to  Theorems  2  and  3  to  obtain  the  following  important  corollary. 
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Corollary  2: 

W*  2 d-'  *  W); 

(32) 

f[sg,d]W  S  °'1  [g,d](>l/s)  +  f[s,d](K)- 

(33) 

The  inequality  is  required  in  Corollary  2  since  we  have  no  guarantee 
that  an  f- network  constructed  using  Theorem  2  or  Theorem  3  is  minimal. 

In  fact,  the  [3,6]  f-network  exhibited  by  the  l8-sorter  in  Fig.  13 
is  not  the  most  economical  [3,6]  f-network  known.  We  may  use  Theorem  2 
with  s  =  2,  d  =  3>  to  build  a  [3>6]  f-network  out  of  three  [3,2]  and 
one  [3,3]  f-networks.  Using  Table  1  we  see  that  this  f-network  requires 

WN>  ■ 3  W"'3)  *  W”  ■  5  ’  ■ m 

7 

comparators,  whereas  the  f-network  in  Fig.  13  requires  ^  N  -  13  =  29 
comparators.  (However,  a  slight  modification  of  the  [6,3]  f-network 
illustrated  in  Fig.  13  reduces  the  number  of  comparators  to  i  N  -  is.) 

The  number  of  comparators  required  by  the  best  f-network  that  can 
be  constructed  out  of  smaller  f-networks  using  the  construction  of 
Theorem  2  and/or  Theorem  3  is  neatly  summarized  by 


[i*,d] 


(N)  = 


min 

1  <  q  <  g 

g  mod  q  =  0 


min 

1  <  p  <  d 

2  <  q  +  p 
d  mod  p  s  0 


*(«»a,H»q,p) 


(35 


where 


F(g,d,N,q,p)  =  ‘»’P*f[g/q,d/p](N/(<,-p))  +  f[q,p](N) 
A  A 

+  +  p'Wp](*'p)  ' 


(36) 
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A  ,  .  A 

Note  that  fr  _(N)  =  f-  (n)  =  0,  so  that:  a)  if  q  =  1,  then  (36) 
LS>AJ  Li>aJ 

describes  a  construction  that  uses  only  Theorem  2;  b)  if  p  =  1  then 
(36)  describes  the  use  of  Theorem  3  alone;  and  c)  if  p,q  >  1  then  (36) 
describes  a  network  built  using  both  theorems.  The  case  p  =  a  =  1 
is  not  allowed,  since  it  would  reduce  (35)  to  an  identity. 

We  may  use  the  construction  methods  of  Theorems  2  and  3,  along 
with  the  f-networks  in  Table  1,  to  achieve  [g,d]  f-networks  for  all 

1  j  * 

g,d  of  the  form  2  3  •  When  g  and/or  d  have  prime  factors  greater 
than  3,  we  may  construct  a  [g,d]  f-network  as  follows:  Using  Batcher's 
general  method  we  obtain  a  [2, d]  (2d)- sorter  network.  This  (2d)- sorter 
will,  of  course,  exhibit  all  of  the  templates  required  by  any  [2,d] 
network.  We  may  use  the  [2,d]  f-network  and  Theorem  3  to  derive  a 
[2  ,d]  f-network,  where  g  <  2*.  From  Definition  1  it  is  clear  that  a 
[21,d]  f-network  is  also  a  [g,d]  f-network  for  all  values  of  g  <  21, 

(This  is  because  the  unsorted  patterns  remaining  in  the  intermediate 
set  V  for  the  [g,d]  sorter  are  a  subset  of  those  remaining  in  the 
[2i,d]  network,  if  g  <  21. )  Therefore,  we  may  construct  [g,d]  f-networks 
for  arbitrary  g,d. 

We  will  conclude  this  section  by  calculating  the  number  of  compara¬ 
tors  required  by  the  [gSd^]  f-network  constructed  using  Theorems  2 
and  3.  From  Equation  (35)  we  obtain 


*  Note  that  these  construction  techniques  are  illustrated  by  networks 
in  Table  1:  the  [2,4]  f-network  illustrates  Theorem  2,  while  the 
[4,2]  f-network  illustrates  Theorem  3« 
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V,dJ] 


min  min 

0  <  r  <  i  0  <  s  <  j 

0  <  r  +  s 


F(g,d,N,gr,dS) 


(37) 


Since  *£g>dj(N)  is  linear  in  N  for  all  of  the  tabulated  f-networks, 
we  expect  a  solution  to  (37)  of  the  form 


f  i  4  (N)  =  a  N  -  b 

u  V:  u V)  ceV] 


(38) 


Usx..g  (36)  and  (38)  in  (37)  we  obtain 


i  J- 


min 


min 


[g  »dJ]  0  <  r  <  i  0  <  s  <  j 

0  <  r  +  s 


V‘V"S] +  a[gr,ds] 


+  a  ,  +  a 

[8  ,<■“]  uV"] 


i  (39) 


b  ^  .  =  max  max 

[g,dJ]  0  <  r  <  i  0  <  s  <  J 

0  <  r  +  s 


r  s 

g  d  b  .  .  +  b 

r  i-r  , J-3-,  r  r  s. 
U  >d  ]  [g  ,d  ] 


+  grb  .  +  dSb 

r  i-r  .s-  r  r  .J-S- 

[k  >d  1  C«  >d  1 


(40) 


Equations  (38)-(40)  describe  the  number  of  comparators  required 
by  a  [g1^]  f-network  built  out  of  smaller  f-networks  using  Theorem  2 
and/or  Theorem  3,  Most  of  the  best  [g^d^]  f-networks  known  exhibit 
this  construction  a*-  .  are,  therefore,  described  by  (38)-(40). 


For  many  values  of  g,d,  all  of  the  best  [gr,dS]  f- networks  known 
(0  <  r  <  i  ,  0  <  s  <  j,  0<r+s)  are  constructed  from  the  [g,d] 

f-network  by  repeated  use  of  Theorems  2  and  3.  In  this  case  the  solutions 
to  (39)  and  (40)  are 


[gVi 


=  i*J*a 


=  b 

(g-l)(d-l)  [g,d] 


(41) 


(42) 


From  Table  1  we  observe  that  ar.  ,  -  =  2,  br,  ,  .  =  11,  whereas  from 

[4,4]  [4,4]  ' 

Equations  (4l)  and  (lf2)  (evaluated  with  g=d=i=j=2)we  find  that 

the  [4,4]  f-network  constructed  from  the  [2,2]  f-network  according  to 

Theorems  2  and  3  requires  a^^  =  ^2,2]  =  2'  b[4,4]  “  9b[2,2]  =  9’ 

The  [4,4]  f-network  given  in  Table  1  is  the  smallest  example  of  a 

k  k 

special  procedure  which  has  been  discovered  for  constructing  [2,2] 

f-networks  [  7  ]•  The  special  procedure  is  too  complicated  to  include 

in  this  paper.  Basically,  it  requires;  a)  determining  the  templates 
k  k 

required  by  the  [2  ,2  ]  f-network  derived  using  Theorems  2  and  3;  and 
b)  reordering  these  templates  in  such  a  manner  that,  although  the  result¬ 
ing  network  still  orders  V,  some  of  the  comparators  have  bscome 
"redundant"  and  may  be  removed.  Since  the  special  construction  does  not 

reduce  the  number  of  templates,  a  k  .  is  given  by  (4l),  evaluated  with 

[2  , 2  J 

g  =  d  b  2  and  i  =  j  -  k.  However,  the  constant  term  is  increased  to 


„  . 

[sV]  3 


3  2k  *  §  , 


(43) 
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VI.  The  Economy  of  [g,d]  N-sorter  Networks 

We  have  defined  a  [g,d]  N-sorter  network  to  consist  of  g  sorting 
networks  of  size  . . . ,N^,  where  N^  is  an  integral  multiple  of 

d  and  N^  =  N,  followed  by  a  [g,d]  . . . ,Ng)  merge  network. 

Since  N.  is  required  to  be  a  multiple  of  d,  we  cannot  construct  a 
[g,d]  N-sorter  network  unless  N  is  a  multiple  of  d.  This  limitation, 
which  was  included  since  it  greatly  simplifies  the  description  of  [g,d] 
merge  networks  and  [g,d]  f-networks,  can  be  removed.  In  [  3  >  5  ]  a 
procedure  is  given  for  pruning  an  N-sorter  network,  that  is  eliminating 
one  input  lead,  one  output  lead,  and  several  comparators,  to  achieve  an 
(N-l)-sorter  network.  For  arbitrary  N  we  can  use  the  [g,d]  strategy 
to  achieve  an  N-sorter  network  by  1)  deriving  the  [g,d]  (dfN/dl)- 
sorter  network  and  2)  pruning  as  necessary.  If  we  extend  the  defi¬ 
nition  of  a  [g,d]  N-sorter  network  to  include  the  sorting  networks 
achieved  by  pruning  a  [g,d]  sorting  network,  then  for  all  values  of  N 
txcept  N  =  10,13,14,15,16,  or  18,  the  most  economical  N-sorter  known 
is  a  [g,d]  sorting  network. 

We  can  also  use  pruning  to  achieve  a  [g,d]  (N^,Ng, ...,N  )  merge 

network  when  not  all  of  the  N^  are  integral  multiples  of  d.  Let 

Mr  , (n  , N„ ..... N  )  represent  the  number  of  comparators  contained  in 
[g,d] '  1'  2’  ’  s' 

the  (N^Ng, . . .  ,Ng)  merge  network  achieved  by  pruning  (if  necessary)  the 
[g,d]  (df N1/d],df Ng/d], . . . ,df N^/d] )  merge  network.  Then  the  minimum 
number  of  comparators  required  by  a  (N^N^, . .  •  >Ng)  merge  network 
structed  using  any  [g,d]  strategy  io  given  by 


con- 


Mg(N1,N2,...,Nf)  =  min  * *»V* 


(44) 
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It  is  instructive  to  ask  which  values  of  g  and  d  yield  the  most 
economical  N-sorter  networks.  Let  S^(n)  represent  the  number  of  com¬ 
parators  required  by  the  most  economical  N-sorter  that  uses  g  sorting 
networks  followed  by  a  [g,d]  merge  network.  In  order  to  permit  a  valid 
comparison  of  networks  achieved  with  different  values  of  g,  we  will 
require  that  each  of  the  g  initial  sorting  networks  must  itself  use 
the  g-way  divide-sort-merge  strategy,  so  that  S^(N)  satisfies  the 
recurrence  relation 


VB)  ■ 


min  {  M  +  Z  «_(*.)}.  (45) 

N,+...+N  =N  8  Bl<l<g* 

1  g  -  -  6 

N  >  ! 


We  have  calculated  S  (N)  for  g  =  2,3,  and  4  and  N  <  3 6;  the 

*  A 

results  are  given  in  Table  2.  The  last  column,  labeled  S(n),  gives 

the  number  of  comparators  contained  in  the  most  economical  N-Borter  known 

of  any  construction.  An  asterisk  indicates  those  values  of  s(N)  which 

* 

represent  an  improvement  over  the  most  economical  networks  previously 
reported  [  5  ]• 

From  Table  2  we  observe  that  S^(N)  is  only  occasionally  smaller 
than  Batcher's  result,  B(N)  =  S^N).  However,  S^(N)  <  S^N)  for  all 
N  >  8,  and  the  [4,d]  N-sorter  networks  are  more  economical  than  any 
previous  N-sorter,  for  N  >  34. 


*  The  improved  l8~sorter,  which  does  not  use  a  [g,d]  strategy,  is  given 
in  Fig.  16.  The  improved  26-, 27-, 28-,  and  34-sorters  all  use  two 
initial  sort  units,  one  of  them  the  particularly  efficient  16-sorter 
designed  by  M.  W.  Green,  followed  by  Batcher's  [2,2]  merge  network. 
The  best  35-sorter  is  achieved  by  pruning  one  lead  from  the  [4,9] 
36-sorter;  the  oest  36-sorter  uaws  the  [3,12]  strategy. 
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A 


N 

g  =  2 

?  ■  2 

g  =  4 

8(H) 

2 

1 

1 

1 

1 

3 

3 

3 

3 

-  3 

4 

5 

6 

5 

5 

5 

9 

9 

9 

9 

6 

12 

12 

12 

12 

7 

16 

17 

16 

16 

8 

19 

21 

19 

19 

9 

26 

25 

25 

25 

10 

31 

32 

30 

29 

u 

37 

37 

35 

35 

12 

4i 

42 

39 

39 

13 

48 

51 

47 

46 

14 

53 

57 

52 

51 

15 

59 

62 

57 

56 

1 6 

63 

70 

61 

60 

17 

74 

76 

73 

73 

18 

82 

81 

80 

79* 

Table  2. 

8  (H)  for  g  <  4, 

N  <  36. 

)0 


N 

8  =  2 

g  »  3 

g  «  4 

*JS) 

19 

91 

93 

89 

88 

20 

97 

101 

95 

93 

21 

107 

108 

104 

103 

22 

11  + 

117 

110 

HO 

23 

122 

125 

118 

118 

24 

127 

131 

123 

123 

25 

138 

14 1 

135 

134 

26 

146 

148 

143 

141* 

27 

155 

154 

i5i 

150* 

28 

161 

168 

157 

156* 

29 

171 

178 

168 

166 

30 

178 

187 

174 

172 

31 

186 

197 

182 

180 

32 

191 

20  i 

187 

185 

33 

207 

214 

203 

203 

3^ 

219 

226 

214 

213* 

35 

232 

234 

225 

225* 

36 

241 

241 

233 

232* 

Table  2  (cont.) 


*  This  value  of  8(N)  describes  an  N-sorter  network  that  Is  more 
econonlcal  than  any  N-sorter  previously  reported  [  5  ]. 


We  may  discover  the  asymptotic  growth  of  S  (N)  by  considering  the 
k+1 

case  N  =  g  .  For  all  values  of  g  tried  we  have  found  that  the 


minimum 


so  that 


in  the  right-hand-side  of  (45 )  occurs  when  N  =  Ng  *  . . .  =  g  , 


„  ,  k+lN  „  ,  ks  ,,  ,  k  k  kx 

3g(«  )  =  8  sg(g  )  +  Mg(K  >«  >8  )• 


k  lc  k 

We  have  also  found  that  the  most  economical  [g,d]  (g  ,g  ,...,8  )  merge 
network  known  is  achieved  using  d  =  g,  so  that 


Hg(gk,-«->gk)  »  M[g,g](gk,*"'gk) 


„  .  k-1  k-l>  'I  ,  k+1. 

g  Mg(g  )  +  f[g,g](g 


The  solutions  to  the  recurrence  relations  given  i  (47)  and  (46), 

A 

with  the  boundary  condition  S^(g)  =  M  (1,1, . . .,1)  =  S(N),  are 


where 


“g(sk,-.-,*k)  *  (a*gk  +  +  »,.)  «k+1  +  U-'b,; 


sg(«k)  -  (cy<2  +  Pgk  +  Yg)  ek  -  vg. 


\  -  «'1s(«)  -  *•[,,,]  -  I  V1)*1 ‘51) 
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(F  -;)"2  b. 


[gig]* 


The  asymptotic  growth  of  S  (n)  may  be  obtained  from  (49);  it  is 


given  by 


s  (n)  =  a  N(iog  n)  +  3  N(iog  n;  +  v  n  +  o(i) 

6  g  g  S  S  g 


=  ag(log2g)_2  N(log2N)2  + 


e  (log2g)“  N(log2N)  +  v  H  +  0(1). 


We  may  obtain  the  coefficients  ar  ,  and  br  ,  from  Table  1  and  use 

LgjgJ  Lg>gJ 

them  in  Equations  (49)- (53)  to  show  that 


S2(N)  =  ^  N(log2N)2  -  ^  N(log2N)  +  N  +  0(l); 


S3(N)  =  .265  N(log2N)*i  -  .315  N(log2N)  +  1.25N  +  0(1);  (54) 


V")  =  l  N(log2N)2  '  3  N(log2N)  +  jN  +  0(1). 


Since  the  leading  coefficient  for  8^(1*)  exceeds  that  for  both  S2(n) 
and  S^(N),  it  is  not  surprising  that  S^N)  18  generally  larger  than 
the  other  two.  Also,  although  the  leading  coefficient  is  J  for  both 
S2(n)  and  S^(N),  the  coefficient  for  the  term  N(log2N)  is  smaller 
for  S^(N).  This  explains  why  ;^(N)  is  smaller  than  Sg(N)  for 
sufficiently  large  N,  that  is,  for  N  >  8. 


From  (50)  and  (53)  we  observe  that  the  coefficient  for  the  term 

2 

l^log^N)  in  the  expansion  of  8^(n)  is 


ag(log2g)”2  -  *[,,„]■ 


When  g  =  2r,  we  may  use  (4l)  to  s.iow  that  (55)  reduces  to  However, 

2 

for  all  [g,g]  f-networks  known,  ar  ,  >  i{log  g)  if  g  is  not  a 

Lg>£TJ  £  c 

power  of  2.  Therefore,  the  leading  coefficient  in  the  expansion  of  S  (n) 
is  minimized  -  -  and  its  value  is  £  *•  -  if  and  only  if  g  is  a  power  of  2. 

In  view  of  the  above  observations  we  conclude  that  the  most  economical 
N-sorter  networks  are  achieved  when  g  is  a  power  of  2.  Furthermore,  we 
might  hope  to  achieve  successive  reductions  in  the  asymptotic  growth  of 
S^(N)  by  choosing  g  =  2*  (whi^h  maintains  a  leading  coefficient  of  ■£), 
for  successively  larger  r.  Therefore,  we  use  (50)“(53 )t  along  with 

*[8, b[8,6]>  *[16,16]’  “d  V,l6]  (U)  «“  («).  ■“ 

with  S(8)  =  19  and  S(l6)  =  60  obtained  from  Table  2,  to  derive 


Sg(N)  =  £  N(log2N)2  -  i  N(loggN)  +  |  N  +  O(l). 

(56) 

S16(N)  “  T7  N(log2N)2  -  i(g)  N(log2N)  +  J|N  +  0(1). 

Comparing  these  results  with  (54)  we  observe  that  for  sufficiently  large 
N,  Sq(h)  >  S^(N)  >Sl6(H).  And,  trying  «  =  2*  for  successively  larger 
values  of  r,  we  find  that  successive  improvements  occur  only  when  r  is 
itself  a  power  of  2,  so  that  the  first  improvement  over  8^(1*)  occurs 
a  2^  a  8,  or  g  *  2  ■  256. 


when  r 


55 


(60) 


From  (43)  and  (52)  we  obtain 
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VII.  Conclusion 

The  strategy  used  by  most  of  the  best  previous  N-sorter  networks 
is  to  divide  the  N  inputs  into  2  groups,  sort  these  groups  separately, 
and  then  merge  the  results.  The  best  merge  networks,  suggested  by 
K.  E.  Batcher  [  4  ],  partition  each  sorted  multiset  into  2  divisions,  merge 
each  division  of  the  first  sorted  multiset  with  one  of  the  second,  and 
then  use  £n-1  comparators  to  resolve  the  remaining  ambiguities. 

Our  results  demonstrate  that  greater  economy  can  be  achieved  in 
N-sorter  networks  by  dividing  the  N  inputs  into  g  >  2  groups  that 
are  sorted  separately,  and  by  partitioning  each  sorted  multiset  into  d  >  2 
divisions  to  be  combined  by  d  merge  networks.  In  particular,  we  have 
shown  that  by  using  g  =  d  =  4,  we  can  achieve  N-sorter  networks  that 
are  more  economical  than  Batcher's  for  N  >  6  and  that  are  more  economi¬ 
cal  than  any  networks  previously  designed  for  N  >  34.  We  have  indicated 

r 

that  even  greater  savings  can  be  achieved  by  using  g  =  d  <=  2  ,  where 
r  =  2S  >  4;  however,  these  constructions  are  only  applicable  for 
N  >  g*d  =  4r. 

Our  N-sorter  networks  require  order  N^og^N)  fewer  comparators 
than  the  best  previous  networks.  However,  we  have  not  been  able  to  im- 
prove  upon  the  asymptotic  growth  rate  of  iN(loggN)^  achieved  with 
Batcher’s  construction.  As  noted  above,  the  coefficient  for  the  term 

O 

N(log-N)  with  our  construction  is  given  by 


ag(lo*2g)" 


Since  ar  ,  is  (l/g)  times  the  number  of  templates  required  by  the 

L8>gJ 


(  X  <*• 


j  -  _ .  *  f,  * 
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[g,g]  f-network,  we  could  reduce  the  coefficient  of  the  tern  H(loggN) 
by  constructing  an  Improved  [g,g]  f-network  that  required  fewer  than 
&(log2g)  templates.  However,  In  Appendix  B  we  show  that 


*[g#d](»)  >  (1  "  d"1)  N  -  (d  -  1), 


(64) 


so  that  ar  •■  >  0  and 
require  order 


ag  >  0.  Therefore,  the  [g,d]  strategy  must 
comparators. 


Appendix  A:  Proof  of  Theorem  3 


The  proof  of  Theorem  3  is  facilitated  by  the  following  two  lemmas. 


Lemma  1 : 

T'  the  rows  V,.  v  of  an  r  x  d  array  V  are  ordered,  then  the 
columns  V,  .  are  also  ordered  if  and  only  if 


n(r,*)  5  "(r-l,*)  —  *•*  -  “(l,*) 


(65) 


Proof:  Lemma  1  follows  from  Theorem  1  by  symmetry. 


Lemma  2: 


Suppose  that  the  t  x  8  planes  V,  v  of  the  t  x  s  x  d  array 

V 

V  are  ordered.  Then  if  we  sort  the  t  d  planes  ^  the  t  x  s 
planes  remain  ordered. 


Proof: 

Assume  that  V.  .  is  ordered,  1  <  k  <  d.  Then  the  columns 

—  — 

V,  »  1  <  i  <  t,  and  the  verticals  V,  .s,  1  <  j  <  s,  are  also 

ordered,  since  they  are  all  submultiset?  of  #  Therefore,  by 

Theorem  1, 

>,.,10  s  ”(.,.-i,k)S  •••  ^  V.i.k)  • 


:«) 


Summing  (66)  for  1  <  k  <  d  we  find  that 


(*,»,»)  i  •••  -"(*,1,*) 


<  ...  <  n. 


(67) 


6o 


Now  suppose  we  sort  the  t  x  d  Dianes  V.  .  ,  1  <  j  <  s.  We 

( ~  ~ 

need  to  show  that  this  operation  leaves  the  planes  V ^  #  j  ordered. 
Clearly  sorting  V,„  .  does  no?:  alter  the  number  of  0's  in  V, 
therefore,  n,  v  satisfies  (67)  after  the  t  x  d  planes  are  sorted. 
Also,  once  ^  ^  is  sorted,  the  n^  j  0's  are  divided  among  the 

rows  V,  %  according  to 


(M,*) 


0  «  »(w)  e  [t>, 

-  (‘-Do  11  «  td-D-.io];  <«> 


if  n. 


Equations  (67)  and  (68)  together  imply  that 


D(t,s,*)  n(t,s-l,*)  <  ^  E(t,l,*)  ^  “(t-1,8,*)  ^ 


n(t-l,s-l,*)  ^ 


<  . ..  <  n 


(69) 


We  can  consider  V  to  be  a  ts  x  d  array  with  ts  "rows”  v^t 
1  <  i  <  t,  i.  <  j  <  s,  and  with  d  "columns"  v(*,*ik)*  1  -  k  -  d' 
Sorting  V-  \  orders  the  "rows"  V,,  ,  because  they  are  all  sub- 

multirets  of  If  V(*^k)  i«  initially  ordered,  then  sorting 

V(*  J  *)  lead*  to  N°W  by  Le“B*  l’  lf  the  ,  rcWi"  V(i,J,*)  *” 

ordered  and  n^  ^  ^  satisfies  (69)..  then  the  "columns"  »« 

also  ordered.  Therefore,  if  v(*#*;k)  *•  ordered,  then  sorting 

vc,j,*)  V,*,*)  ort’”d' 

Q.E.D. 
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We  are  now  ready  to  prove  Theorem  3, 


Theorem  3: 

Let  the  multiset  V  =  £vi»v2> where  N  =  tsd,  be  considered 
a  t  X  s  x  d  array.  Then  the  following  small  f-networks  together 
constitute  an  [sg,d]  f-network  for  V. 

0  8  [g,d]  f-networks  for  V ^  j  1  <  J  <  s;  followed  by 

ii)  one  [s,d]  f- network  for  V. 


Proof: 

According  to  Definition  1,  tie  sequence  of  comparators  represented 

by  i)  and  ii)  is  an  [sg,d]  f-  network  for  V  if  and  only  if  it 

will  complete  the  ordering  of  V  given  that:  a)  the  d  planes 

1  <  k  <  d,  are  ordered;  and  b)  n,  .  satisfies 

{*,■*,&)  —  —  [*,*,*) 


n(*,*,d)  ^  -  “(*,*, 1)  -  “(*,*,<!)  + 


(yc) 


Let  us  assume  that  the  partial  ordering  in  v  satisfies  con¬ 
ditions  a)  and  b).  Then,  since  W.  »  is  ordered,  each  of  the 

\  t  >*) 

submultisets  w,  .  N,  1  <  j  <  s,  is  also  ordered.  In  addition,  the  dis- 
tributlon  of  the  n .  .  0's  among  the  verticals  V,„  .  .  v  satisfies 

+  •  “  J)  /  8j 

We  may  use  (70)  and  (71)  to  show  that 

V,J,d)  -  D(*,J,d-l)  -  **  -  +  ** 
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Since  the  d  submultisets  V/Jt  .  ...  1  <  k  <  d,  of  V/jL  .  are  ordered, 
and  since  n,  .  satisfies  (72),  a  [g,d]  2- network  will  order 


and  since  n,  .  satisfies  u2),  a  [g,d]  2- network  will  order 

v(*,s,*y 

Lemma  2  implies  that  the  [g,d]  f- networks  that  order  the  t  x  d 
planes  j  leave  the  t  x  s  planes  #  ^  ordered.  Furthermore, 

once  V/„  .  is  ordered,  the  distribution  of  the  n,  .  0's  among 
the  verticals  V/„  .  .  .  satisfies 


=  +  d  -  k)  /  dJ 


Equation  (73)  implies  that 


V,.i,<0  "  "(M.H-l)  -  ••  -  -  n(*,J,d) 


Summing  (74)  for  1  <  j  <  s  we  obtain 


K(*,*,d)  -  “(*,*, d-1)  -  ’ 


,.  <  n 


5  “(*,*,!)  $  n(*,sd) 


Therefore,  since  the  [g,d]  f- networks  for  V,  .  leave  the  planes 
v(*,*,k)  ord«red,  (75)  guarantees  that  an  [s,d]  f-network  will  complete 
the  ordering  of  V. 

We  have  seen  that  if  tne  partial  ordering  in  V  satisfies  con¬ 
ditions  a)  and  b),  then  the  s  [g,d]  f-networka  in  i)  followed 
by  the  [s,d]  f-network  in  ii)  will  complete  the  ordering  of  V. 
Therefore,  i)  and  ii)  together  constitute  an  [sg,d]  f-network. 

Q.R.D. 


mmmB. 


,  e  **<,+*.■ 


r  -A,  '  _  -  •  '*  "i.  M 


Appendix  B:  A  Lower  Bound  for  f. 


[8,0] w 

In  this  appendix  we  calculate  a  lower  bound  for  fr  (N; ,  the 

U>d] 

number  of  comparators  required  by  the  most  efficient  [g,d]  f-network 

for  the  set  V  =  {v,  ,v_, . . . ,  v  ]  ,  where  N  =.  td,  t  g  >  ?.  Let  r 

and  s  be  any  two  integers  satisfying  1  <  r  <  t-1,  2  •'  s  <  d.  Then, 

by  definition,  a  [g.d]  f-network  will  complete  the  ordering  of  V  if 

the  columns  Y,  >,  1  <  j  <  d,  are  ordered  and  if 

-  — 


n  --- r+1; 


"(** j;  '  r’ 


2  <.  j  s-l; 


V,!)"1'1'  s£J-d- 


From  (76) we  see  that  V  is  ordered  except  that  the  0  at  position 

V/  ,  \  should  be  moved  to  V  v.  Since  a  fgjdl  f-network  will 

(r+1,1  )  {T,s)  LS>  J 

complete  the  ordering  of  V,  it  must  include  a  comparator  or  a  sequence 


of  comparators  that  provide  a  path  from  ^ 


to  V 


(r>8)' 


Now  a  comparator  can  only  move  a  0  in  one  position  of  V  to  a 
position  labeled  by  a  smaller  index.  Therefore,  a  [g,d]  f-network  must 
contain  either  the  comparator  g):V(r+1  ^  or  else  the  comparator 

V,  siV,  where  s  <  J  <  d  and  where  the  f-network  includes  a  path 

from  V^x  1>  to  jj.  Sine*  r  and  s  are  arbitrary  integers 

satisfying  1  <  r  <  t-1,  2  <  *  <  d,  we  have  shown  that 


*rg>d-j{td)  >  (t-l)(d-l.,  g,d>?, 


or,  using  N  =  td, 


f[g,d](N)  ^  (1  -  d-1)  N  -  (d  -  1),  g,d<2.  (78 

For  g  >  2  and/or  d  >  2  (78)  does  not  provide  a  very  tight  bound 
for  fj-g  ^(N);  ^-ndeed,  it  is  not  at  all  the  greatest  lower  bound  known. 
However,  (78)  is  sufficient  to  show  that  ar  ,  >  0  and  a  >  0,  so 

LgjgJ  g 

that  th  ’  number  of  comparators  required  by  a  [g,d]  N-sorter  network  grows 
as  N(log2N)2. 


’  .-,  k>  X/$  A -V-tVR. ». 


imwwpsp^.!^ ,« 
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